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Abstract 

Consider ordinary bond percolation on a finite or countably infinite 
graph. Let s, t, a and b be vertices. An earlier paper [3] proved the 
(nonintuitive) result that, conditioned on the event that there is no 
open path from s to t, the two events "there is an open path from s to 
a" and "there is an open path from s to 6" are positively correlated. 
In the present paper we further investigate and generalize the theorem 
of which this result was a consequence. This leads to results saying, 
informally, that, with the above conditioning, the open cluster of s 
is conditionally positively (self-)associated and that it is conditionally 
negatively correlated with the open cluster of t. 

We also present analogues of some of our results for (a) random- 
cluster measures, and (b) directed percolation and contact processes, 
and observe that the latter lead to improvements of some of the results 
in a paper of Belitsky, Ferrari, Konno and Liggett (1997). 



1 Introduction and results for ordinary per- 
colation 

This paper is concerned with positive and negative correlation and the stronger 
notion of positive association. Recall that events A, B (in some probability 
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space) are positively correlated if Pr(AB) > Pt(A)Pt(B), and negatively 
correlated if the reverse inequality holds. Positive association will be defined 
below (following Theorem 1.3). 

We begin in this section with results for ordinary bond percolation. Our 
original motivation here (and for the present work) was Theorem 1.4. 

We then consider extensions to the random cluster model (Section 2) and 
to percolation on directed graphs, together with applications to the contact 
process (Section 3). 

A few words about proofs may be in order. The approach given for 
percolation in Section 1 is similar to that of [3] (see the proof of the present 
Theorem II .lj) . This approach does not seem applicable to the random cluster 
model, and Section 2 takes a completely different route, based on Markov 
chains, to extend the results of Section 1 to this more general setting. We 
also describe, in Section l2~2*l a different way of getting at some of the random 
cluster results. This is based on a connection with the "fuzzy Potts model," 
and is included here despite handling only a subset of what's covered by the 
Markov chain approach, because we think the relevance of the connection 
is interesting. The results for "directed percolation" in Section |3] can again 
be obtained using either the approach of Section ^ or the Markov chain 
approach of Sectional Here we have tried to keep the discussion brief, mainly 
indicating those points where the generalization from what came before may 
not be entirely obvious. 

Consider bond percolation on a (finite or countably infinite, locally finite) 
graph G = (V,E), where each edge e is, independently of all other edges, 
open with probability p e and closed with probability l—p e . For a,b G V the 
event that there is an open path from a to b is denoted by a <-> b, and the 
complement of this event by a ^ b. For X, Y C V we write X <y4 Y for the 
event {x y Wx G X, y G Y}. 

In an earlier paper j3] we showed that, for any vertices s, t, a, b, 

Pr(s <-> a, s <-> b \ s */* t) > Pr(s «-> a \ s */* t) Pr(s «-> b \ s */* t). (1) 

This was a consequence (really a special case) of Theorem 1.2 of to which 
we will return below. 

Here we show, among other results, a sort of complement of (flj). viz. 

Pr(s a, t <-> b\s </> t) < Pr(s «-> a \ s </> t) Pr(t «-> b | s </> t). (2) 
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In this section we will prove the quite intuitive (j2J) by way of a generalization 
of the not very intuitive Before giving this generalization, we need some 
further definitions and notation. 

Let s be a fixed vertex. By the open cluster, C s , of s we mean the set 
of all edges which are in open paths starting at s. As in [3] we define, for 
icy, the event 

R x ■= { S </> X} = {s */> x Vx G X}. 

Let Q = {0, 1} E be the set of realizations; elements of Q will typically 
be denoted u. Recall that an event A is increasing (really, nondecreasing) if 
uo' > uj G A implies u' G A. We also say that A is increasing and determined 
by the open cluster of s if uj G A and C s {yj') D C s (u) imply uj' G A. (Note 
that such an event is increasing in the sense above.) A simple example of 
such an event is {s «-> a}. 

The following statement is a natural generalization of Theorem 1.2 of [3]. 

Theorem 1.1. Let A and B be increasing events determined by the open 
cluster of s. Then for all X, Y C V \ {s}, 

Pt(AR x )Pt(BR y ) < Pr(ABR XnY )Pr(R X[jY ). (3) 

Remark. Theorem 1.2 in jB] is the special case where each of A, B is of the 
form {s <-> w Vw G W} for some W C V. The proof of the present more 
general result is almost the same and we present it in a slightly abbreviated 
form, emphasizing the parts which need extra attention because of the gen- 
eralization. (One may say that the key idea (in both cases) is generalizing 
from statements like (0) to the form (J3J), which supports an inductive proof.) 

Proof. We give the proof for finite G; the infinite case then follows by stan- 
dard limit arguments. The proof is by induction on the number of vertices. 
When G has only one vertex, the result is obvious; so we suppose, for some 
n > 1, that the result holds for graphs with at most n vertices, and consider 
G with n + 1 vertices. 

With notation as in the theorem, it is easy to see that there is an event 
A C A with the following properties: it is increasing and determined by C s ; 
it does not depend on 

E x := {e G E : e n X ^ 0} 
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(that is, if u' e = u e for all e (jL Ex, then w 6 A iff w' 6 A); and, finally, 
A Rx = A Rx- A similar remark holds for B and Y. So we may assume that 
A does not depend on Ex and B does not depend on Ey. 

If XnY = 0, the r.h.s. of © is Pr(v4E) Pr(i? Xu y), and two applications 
of the FKG Inequality give the result: 

Pt(AR x )Pt(B R y ) < Pr{A) Pr(5) Pr(i? x ) Pr(i? y ) < Pr(A B) Pt(R X uy) 

(4) 

(note -Rxuy = RxRy)- 

Now suppose Z:=lnF^0. Let N be the set of all vertices outside Z 
with at least one neighbor in Z. Let S be the (random) set of those vertices 
of TV connected to Z by at least one open edge. We have 

Pr(ARx) = J^Pr(S = S) Pt(AR x \S), 

s 

where the sum is over S C iV and we write Pr(-|5') for Pr(-|S = S). Similar 
expressions hold for the other terms in ©. Moreover, clearly, 

Pr(S = S) Pr(S = T) = Pr(S = S n T) Pr(S = S U T) V5, T C N. 

So according to the Ahlswede-Daykin ("Four Functions") Theorem (pP or 
e.g. [4 ), © will follow if we show that, for all S,T C JV, 

Pr(Ai? x |5) Pr(B/2 y |T) < Pr(A 5 i? Xny |5 n T) Pr(i? Xuy |5 U T). (5) 

Now it is easy to see that, for any set of vertices W D Z, and any event 
D that does not depend on E w , 

Pi{D R W \S) = Pt'(D R {w \z)us), (6) 

where Pr' refers to the induced model on the graph G' obtained from G by 
removing Z. (Strictly speaking, the D on the r.h.s. of (jUJ) is not the same as 
that on the l.h.s., since it is a subset of {0, \} E \ E z rather than {0, 1} E ; but 
since D does not depend on Ew (and hence not on Ez), the two events are 
essentially the same, so we ignore the irrelevant distinction.) 
Applying © to each of the four terms in (J3J), we have 
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Pt{AR x \S)Pt{BR y \T) = Pt'{AR {x \ z)uS )Pt'(BR (yV)ut ) 

< Pr (ABR(( X \z)us)n((Y\z)uT)) Pr ' (R(x\z)usu(y\z)ut) 

< Pi'(ABR((xnY)\z)u(snT)) Pr'(i2((xuK)\Z)u(SuT)) 
= Pr(ABfl XnY |S'nr)Pr( J R xuy |5 , ur), 

where the first inequality follows from our inductive hypothesis (applicable 
since G' has fewer vertices than G), and the second from 

((x \z)us)n ((Y \z)ut)d ((x n Y) \ z) u (S n T) 

and (X \ Z) U 5 U (V \ Z) U T = ((X UF)\Z)U(SU T). □ 
In particular we have the promised generalization of (JTJ: 

Theorem 1.2. For s,A,B and X as in Theorem W. 11 

Pr(A 5 | s <y4 X) > Pr(A | s </> X) Pr(5 | s X). (7) 

Proof. Take K = X in Theorem ll.il □ 
Remarks 

1. It is easy to see that Theorem ll.2l is equivalent to the special case where 
|X| = 1. (To reduce to this, simply identify the vertices of X, retaining all 
edges connecting them to V \ X (edges internal to X may be deleted, but 
are anyway irrelevant).) We have used the present form both because it will 
be convenient for the proof of Theorem 11.51 and because it is natural from 
the point of view of the contact process application in Section 01 Similarly, 
we could replace s in all results of this section, and t in Theorems 11.41 and 
11.51 by sets of vertices. The same easy equivalence holds for the directed 
graph results of Section El but in the case of the random cluster measures of 
Section |21 the more general statements, while still true, do not seem to follow 
in the same way from their specializations. 

2. The derivation of Theorem 11.21 may give the impression that it is less 
general than Theorem 11.11 but in fact the two are equivalent. To see this, 
first note that (consideration of appropriate complementary events shows 
that) Theorem 11.21 also holds when A and B are both decreasing rather than 
increasing, while the inequality (JJj) reverses if one of A, B is increasing and 
the other decreasing. Thus for A, B, X, Y as in Theorem 11.11 Theorem 11.21 
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implies that conditioned on RxnY, each of the pairs (A,R x \y), (B,Ry\x) 
is negatively correlated, while each of (A, B), (R x \y,Ry\x) is positively 
correlated. So, writing Pr' for our percolation measure conditioned on RxnY, 
we have (compare (jlj)) 

Pi' (A R X \ Y ) Pi' (BR Y \ X ) < Pt'(A)Pt'(R x \ y )Pv'(B)Pt'(R Y \ x ) 

< Pr'(AB)Pr'(R x \ Y RY\x), 

which is equivalent to @. 

It will be helpful to have the "functional extension" of Theorem 11.21 

Theorem 1.3. For s, X as in Theorem \1.2\ and f,g bounded, increasing, 
measurable functions of C s , 

E[/ g\s ^ X}>E[f\s X}E[g\s X]. 
The inequality is reversed if one of f,g is increasing and the other decreasing. 

Proof. This is a standard (and easy) reduction. We omit the argument for 
(a) and note that (b) is (a) applied to the pair (/, — g). □ 

Recall that a collection of random variables {o~i : i G T}, with V a finite 
or countably infinite index set and the aVs taking values in {0, 1} (or some 
other ordered set), is said to be positively associated if for any two functions 
/, g of the crj's that are either both increasing or both decreasing (and, in 
case T is infinite, measurable), one has Efg > EfEg. The simplest non-trivial 
example is when the aVs are independent (Harris' inequality). 

If we define a random subset W of a set T to be positively associated if the 
collection {r](a) = l{ a £W} '■ a G T} is positively associated, then Theorem 
11.31 savs that the open cluster of s is conditionally positively associated given 
the event {s */* X}. We will see further, similar examples later. 

Positive association is often derived from the FKG Inequality, which gen- 
eralizes Harris' inequality and says that positive association holds for mea- 
sures (on {0, 1}™, say) satisfying the "positive lattice condition" (also called 
"FKG lattice condition"), viz. 

/x(cr)/i(r) < n(a A r)//(cr V r) (8) 
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(where (er A r) x and (er V r) x are the minimum and maximum of a x and t x ). 
The positive lattice condition is much stronger than positive association. 
It says that the conditional probability that o~ x — 1, given the values of 
a y , y 7^ x, is increasing in those values. 

Let us also recall here that for measures v and v' on {0, l} n (or some other 
partially ordered set), v stochastically dominates v' [y >- u') if f(f) > v'(f) 
for every increasing function / (where v(f) is the expectation of / w.r.t. v). 

As suggested earlier, we do not see any good reason to expect something 
like Theorem 11.21 (For instance, as noted in [3] , it is easy to see that the 
analogous statement with s t in place of s +A t is false.) Nonetheless, it 
implies the following intuitively more natural statement, which says, infor- 
mally, that conditioned on nonexistence of an open (s, t)-path, the clusters 
C s and C t are negatively correlated. 

Theorem 1.4. Let s andt be (distinct) vertices, and f and g bounded mea- 
surable increasing functions of C s and C t respectively. Then 

E[fg\s*/>t]<E\f\s*/>t]E\g\s</>t]. 

Note that (J2J) is the special case where / is the indicator of the event {s «-> a}, 
and g that of the event {t <-> b}. 

We have stated Theorem II .41 above largely because, as mentioned earlier, 
it was the original motivation for this work; but the next statement, which 
contains Theorem 11.31 as well as Theorem 11.41 seems to be the correct level 
of generality here. 

Theorem 1.5. Let s and t be (distinct) vertices, and f and g bounded, 
measurable functions of (C s , Ct), each increasing in C s and decreasing in Ct- 
Then 

E[fg\s^t}>E[f\s^t]E[g\s^t}. (9) 

In other words, on {s */* t} we have positive association of all the r.v.'s 
l{eec s } an d l{e£c t }- (Note: here and often in what follows, we use "on Q" 
to mean "conditioned on (the event) Q") 

Proof of Theorem \l.b\ As in the case of Theorem 1.1, it is enough to prove 
this for finite G. 
We have 

E[fg\s^t] = J2 Pr(C* = W\s</>t)E[fg\C. = W], (10) 
w 
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where we may restrict to W containing no (s, £)-path. Write W for the union 
of W and its "boundary" ; that is, W consists of all edges having at least one 
vertex in common with some edge of W. 

When we condition on {C s = W}, f and g become decreasing functions 
of Ct, and the (conditional) distribution of C% is the same as that for the 
restriction of our percolation model to the graph obtained from G by deleting 
all edges in W. Thus (on {C s = W}) f,g are decreasing functions of the 
independent r.v.'s (u e : e E E \ W), and by Harris' inequality we have 
E[fg\C s = W}>E[f\C s = W]E[g\C s = W}. 

On the other hand, the conditional distribution of C t given {C s = W} is 
stochastically decreasing in W (to couple these distributions, choose all u^s 
independently according to their p e 's and then for conditioning on {C s = 
W} simply ignore those u e 's with e E W); so in particular E[/|C S = W) 
and E[^|C S = W] are increasing functions of W, and it then follows from 
Theorem 11.31 that the right hand side of (fTU|) is not less than 



^Pr(C s = W | s <A t) E[f\C s = W}\ ^[Pr(C s = W \ s </> t) E[g\C s = W] 

K W / \ W 

= E[f\s^t] E[g\s^t\- 

so we have Q. □ 
As just shown, Theorem 11.41 follows easily from Theorem 11.31 While one 
might expect a similar proof (or some proof) of the reverse implication, we 
do not see this. In Section 2 we will (as mentioned earlier) take a completely 
different approach which, even for the more general class of random-cluster 
measures, gives Theorems 11.31 and 11.41 "simultaneously" . 



2 Random- cluster measures 

2.1 Definitions and a Markov chain proof 

A well-known generalization of the bond percolation model is the random- 
cluster model (RCM) introduced by Fortuin and Kasteleyn circa 1970. (See 
e.g. Section 13.6, [7J for additional background and references.) 

Let G = (V, E) be a finite graph. In addition to the parameters p e , e £ 
E of the ordinary bond percolation model, the random-cluster model is 
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equipped with a positive parameter q. To avoid trivialities we assume that 
< p e < 1 for all e G E. The random- cluster measure (r.c.m.) with the 
above parameters on Q = {0, 1} E is then given by 

<f q (u) (= <PgM) OC ^ J] p e n (1-Pe), " G fi, (11) 

ee_B:oj e =l eG_E:a; e =0 

where k(w) is the number of connected components in u, and, as usual, 
f(uj) oc g(u;) means f(cu) = Cg{u) for some (positive) constant C. (For the 
present discussion we regard the p e 's as given once and for all, and omit them 
from our notation.) 

Thus q — 1 gives the ordinary bond percolation model. We have, in spite 
of serious attempts, not been able to adapt the approach of Section 1 to q > 1. 
(We do not consider q < 1, for which the correlation properties of the model 
are quite different). Here we take a different, "dynamical" approach, based 
on the introduction of a Markov chain whose states are pairs of clusters (this 
is not the only possibility; see the remark following the proof of Theorem 12 .lj) 
which converges to a measure (on pairs of clusters) corresponding to (fTTj). 
and for intermediate stages of which the correlation properties we are after 
can be derived from known properties of the RCM. 

For the following extension of Theorem 11.51 to the RCM we replace the 
vertices s and t by sets S and T, recalling that the remark following Theo- 
rem EH regarding the easy reduction from sets to singletons is not valid here. 
Extending our earlier notation, we use Cs for the set of edges belonging to 
open paths starting at vertices of S. 

Theorem 2.1. Consider a distribution (fTTj) with q > 1. Let S and T be dis- 
joint sets of vertices, and f and g bounded, measurable functions of (Cs,Ct), 
each increasing in Cs and decreasing in Ct- Then on {S ^ T}, 

Ef9 > EfEg. (12) 

Following the Markov chain proof of this, we also give, in Section 12.21 
a different argument, which unfortunately seems only to work when q > 2 
and \S\ = \T\ = 1. So, somewhat strangely, we have separate (and distinct) 
proofs for the cases q = 1 and q > 2, but for the intermediate case 1 < q < 2, 
no alternative to the Markov chain approach. 
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Proof. We first give some additional notation, and state some (well-known) 
lemmas we will need. If F is a subset of F (the set of edges of our graph 
G), and u G ft = {0, 1} E , we write up for the restriction of u to F (up = 
(u e : e G F)), and V(F) for the set of vertices incident with edges of F. We 
continue to use the notation W introduced following (fTUj) . 

Lemma 2.2. For g > 1, the random- cluster measure (fTTj) satisfies the posi- 
tive lattice condition (J8J). 

When g is conditioned on the values of some of the variables u e , the re- 
maining variables are distributed as they would be under the (natural) r.c.m. 
on the graph obtained from G by deleting e's with uj e = and contracting 
those with u e = 1. For our purposes the relevant cases of this are given by 

Lemma 2.3. Let A C V and F C E. The restriction of <PG,q t° {0,1} E ^ F 
under conditioning on either of the events {Ca = F}, {ujp = 0} (i.e. {u e = 
OVe G F} ) is the r.c.m. with parameter q on G — F (the graph obtained from 
G by deleting all edges in F ); more formally, 

VG, q {uE\F = ■ I C A = F) = (p Gtq (uj E \ F = .\uj f = 0) = (f G - F ,q{-)- 

If A, F are as in LemmalP and B C V\V(F), then {C A = F} C {A y> B}; 
so Lemma f2.3l implies 

Lemma 2.4. If A, F are as in Lemma \2. 6 J\. B C V \ V(F), and tp is (tem- 
porarily) ip G ,q conditioned on {A <y4 B}, then 

lf{uJ E \F = ■\C A = F) = <p G ,q{UE\F = ■ I Up = 0) = if G -F,q{')- 

We now turn to the proof of Theorem 12.11 We consider a Markov chain with 
state space Cl consisting of pairs {Cs, Ct) satisfying Q := {S ^ T}. (So the 
states are pairs (C, C) such that C, C' C F; C (resp. C') is a union of paths 
beginning at vertices of S (resp. vertices of T); and V(C) D V{C') = 0.) 

We write tp for the measure ip G ,q conditioned on Q and (p for the measure 
which ip induces on Q. 

Initially our chain is in some fixed state (Cg, Cj) G Cl. Given (C^ _1 , C^T 1 ), 
the state of the chain at time i — 1, we choose (C l s ,C^) in two steps, first 
choosing C % T according to ip conditioned on {Cs = C 1 ^ 1 } — that is, 

Y>T{Ci r = -) = ip{C T = -\Cs = C l s T l ) 
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and then, similarly, C l s according to 



PT(C % s = -) = <p{C s = -\C T = C t T ). 



It is clear that (p is stationary for this chain, and that the chain is irreducible 
and aperiodic; so to prove Theorem 12.11 it's enough to show 

Claim 2.5. For f,g as in the statement of Theorem \2. II and any n, (112)) 

holds for expectation taken with respect to the law of (Cg, Cj) . 

Let XI and Yl be the indicators of the events {e G" C^} and {e G C l s } (e G E, 
i = 0, 1, . . .). These are, of course, not independent, but we will show, using 
the following presumably well-known observation, that they are positively 
associated. 

Lemma 2.6. Suppose Wi,...,W a and Zi,...,Z b are (say) {0, l}-valued 
r.v. 's with joint distribution if) satisfying 

(i) W\, . . . , W a are positively associated; 

(ii) Zi, . . . ,Z b are conditionally positively associated given Wi, . . . , W a ; and 

(iii) for W,W G {0,l} a with W > W, ip(-\W) y *p(-\W), where if){-\W) is 
the conditional distribution of (Zi, . . . , Z&) given (Wi, . . . , W a ) = W . 

Then Wi, . . . , W a , Z\, . . . , Z b are positively associated. 



Proof. Suppose f,g are increasing functions of Wi,...,Z b , and for W G 
{0,l} a , set F(W) = E[f\W] (:= E[f\(W 1 ,...,W a ) = W}) and G{W) = 
E[g\W\. Then 



where the first inequality follows from (ii) and the second from (iii) and 



Efg 



> 



E{E[f\W}E[g\W}} 



> 



EFEG 
EfEg, 



□ 



Lemma 2.7. The collection 



Uj>i U ee s {XI, Yl) 



(13) 



is positively associated. 
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Note this is enough for Claim l2~5l since (trivially) 

Remark 2.8. For each n, C$ is increasing in the variables X\, Y*, and C% 

is decreasing in these variables. 

Proof of Lemma \2. 7| Of course it's enough to show positive association for 



finite subsets of the collection (|13p. We will show by induction on n that for 
each n, each of the collections 

{X l e : e G E, i < n} U {Y* : e G E, i < n} (14) 

and 

{XI : e G E, i < n} U {Y l e : e G 22, z < n} (15) 

is positively associated. (The base cases — those with n = — are, of course, 
trivial.) Actually we just give the argument for (HU), that for (JTSJ) being 
essentially the same. 

We want to apply Lemma l2~7)l with (Wi, . . . ,W a ) = U((X*,y e i ) : e G 
E,i < n) and (Zi,...,Z b ) = (X™ : e G 22), so need to verify conditions 
(i)-(iii) of the lemma. Of course (i) is just our inductive hypothesis, so our 
concern is really with (ii) and (iii). 

Consider a possible value W of (Wi, . . . , W a ), with F the corresponding 
value of Cg -1 . Under conditioning on {(Wi, . . . , W a ) = W}, we have X" fixed 
for e G F (namely X™ e 1 Ve 6 F), while, by Lemma 12.41 the remaining 
X"'s are distributed as the variables l{ e 0c* t ( w )}, where (u e : e G E \ F) is 
chosen according to (fo-F q- Positive association of these variables is given 
by Lemma f2. 21 so we have (ii). 

Now let W be a second possible value of (Wi, . . . , W a ), with W > W 
and F' the corresponding value of C^ -1 . According to Remark 12.81 we have 
F C F' . So (iii) amounts to saying that for F C 2 7 " C 22 and /i any increasing 
function of CV, 

^|Cs = F) > cp(h\C s = F') (16) 

(note h is a decreasing function of the X™'s). But using Lemmas 12 . 31 and 12 . 41 
we may rewrite the left and right hand sides of (fTo^) as 

and 

WG-F<A h ) = ^G-F, q (h\^F'\F = 0); 

and then (fTB*j) follows from Lemma f2.2l ( which gives positive association for 
the measure (p G _p „) ■ □ 
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Remark. We briefly indicate an alternative proof of Theorem 12.11 again 
using a Markov chain and based on a similar idea. This, our original proof, 
is perhaps more natural than that given above, but does not seem as easily 
adapted to prove the directed version of Theorem 11.51 ( Theorem 13. 4|) . 

We again use if for (fa, q conditioned on {S ^ T}. Our chain in this case 
is u°, u 1 , u 2 , . . . drawn from the state space Cl := {uo G Q : S y» T}. Initially 
the chain is in some fixed state a; . Given a;' -1 , the state of the chain at time 
i — 1, we choose uj % in two steps, first choosing an intermediate configuration 
t % according to if conditioned on {Cs = C 's(c</ _1 )} — that is, for ( e Q with 
C S (C) = C s (co^), 

Pr(r* = C) = <p(u = C\C s (u) = Csi^- 1 )) 

— and then, similarly, uj % according to 

Pr(u/ = C) = <p(u = C\CtH = C T {r 1 )). 

It is clear that (p is stationary for this chain, and that the chain is irreducible 
and aperiodic; so to prove Theorem 12.11 it's enough to show 

Claim. For f,g as in the statement of Theorem \2. 1\ and any n, (I12|) holds 
for expectation taken with respect to the law of {u) n ) . 

To prove this we introduce independent r.v.'s X % e1 F e l (e e E, i — 1, . . .), each 
uniform on [0,1], and some fixed ordering, "-<," of E. Then to decide the 
value of t* we compute the conditional probability, say a, that t\ = 1 given 
the values of the uj 1 ' 1 ^ (or just the value of Csiyj 1 ^ 1 )) and those r^'s with 
e 1 -< e, and set t\ = 1 iff X l e < a. For u % we proceed analogously, with the 
requirement for u l e = 1 now being > 1 — a. 

It is then not hard to show, again using Lemmas I2.2H2.41 that (for each 
n) u n is increasing in the variables XI, so that the Claim follows from 
Harris' inequality. 

2.2 A separate proof for q > 2 

As mentioned earlier, it turns out, somewhat curiously, that for q > 2 and 
S and T consisting of single vertices s and t, we can prove Theorem 12. II in a 
different way by exploiting a connection between the random cluster model 
and the "fuzzy Potts model." (The corresponding connection involving the 
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ordinary Potts model again goes back to Fortuin and Kasteleyn.) Before 
doing so, we need to review some classical and more recent facts concerning 
this connection. 

Let q = a+(3 with a, /3 > 0. Using the random-cluster measure uo we generate 
a random spin configuration a G {0, 1} V as follows. 

(i) Choose u G {0, 1} E according to ip q . 

(ii) For each component C of u>, let a take the value 1 (resp. 0) on all vertices 
of C with probability a/q (resp. (3/q), independently of the values of o on 
other components. Let /i Qj/ g denote the distribution of a. 

(In jS] this is called the fractional fuzzy Potts model.) This procedure pro- 
duces a coupling measure V of uj and a, or, rather, of <p q and fi a ,/3- So we 
may also think of first choosing a and then drawing from the conditional 
distribution V(- \ a) to obtain a typical (with distribution ip q ) edge configu- 
ration uj. It is known (and easy to check) that this "reversed" procedure can 
be described as follows. 

(iii) Choose a according to (i a ,p- 

(iv) For i = 1,0, let G(i) = G[a _1 (i)] (the (induced) subgraph consisting of 
vertices in cr _1 (i) and edges of G contained in this set). Set uj e = whenever 
a assigns different values to the ends of e G E, and choose the restrictions of 
uj to E(G(1)) and E(G(0)) (independently) according to v ? G(i),a and <fG(p),/3- 

Furthermore, if in (iii) we choose a according to the conditional distribu- 
tion 

£a,/j(0 = 1 o-(s) = 1, a(t) = 0), 

then uj (in (iv)) has the distribution we want, namely v 9 G,g(' I s */* t); so for 
Theorem 12.11 we may take uj to be chosen in this way. 

The salient points for our purposes are then as follows. (For clarity we 
now add subscripts to the expectation symbol E to indicate measures with 
respect to which expectation is taken.) 

(a) fLemma 12. 2|) For any graph H and c > 1, <fH,c has positive association. 

(b) It is shown in jH] that for any a, (5 > 1, fi — fi a ^ satisfies (jHJ), whence, 
according to the FKG Inequality, fj, a ^ and fi a $ are positively associated. 
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(c) If a, [3 > 1 and / is a function of (C s ,C t ) which is increasing in C s and 
decreasing in C t , then E-p[/ | a] is increasing a. (This follows from ((iv) and) 
(a).) 

Alternate proof of Theorem \2.1\ for q > 2, S = {s} and T = {t}. 

Let /, g be as in the statement of the theorem. Fix some a, (5 > 1 with 
a + (3 = q. For simplicity we write \i for and ip for ip q . The connections 
described above give 

(7 

> J3/i(a)Ep[/(a;)H E^MM 

> ^/i( ff )Ep[/( W )H^/i(a)Epb( W )H 

cr cr 

= E„[/|s<At] E^^ls ^4t], 

where the first inequality follows from (a) (and (iv)) and the second from (b) 
and (c). □ 

3 Directed percolation and contact processes 

In this section we consider another generalization of ordinary percolation: as 
in Section 1 we have a product distribution on {0, 1} E , but now some (or 
all, or none) of the edges of our graph are oriented. There are (at least) 
two natural ways to try to extend the results of Section 1 to this setting, 
corresponding to two possible extensions of the conditioning event {s *^ t}. 
As we will see, both extensions are reasonable for Theorems 11.1111.31 but 
only one of them makes sense for Theorem 11.41 (and Theorem II. 5j) . The 
first set of extensions yield in particular improvements of some of the results 
of Belitsky, Ferrari, Konno and Liggett 2. regarding the contact process 
(defined below). We will first indicate these extensions (proofs of which 
are essentially identical to the proofs of the corresponding statements in 
Section [TJ) and discuss their relevance to the contact process, before turning 
to the second set of extensions. 

We will need the following additional notation. Unoriented edges will 
be denoted by {v,w} and oriented edges by (v,w) (where the orientation 
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is from v to w). When we speak of a path, we will now mean one which 
respects the orientations of its oriented edges. We write {s — > t} for the 
event that there is an open path from s to t and {s -/->■ t} for the complement 
of this event. The open cluster, C s , of s is again the set of all edges contained 
in open paths starting at s. As in Section 1, we fix a vertex s and set 
Rx = {s -/-> x Wx G X} for each X C V \ {s}. Of course all these definitions 
collapse to those of Section^ 11 case there are no oriented edges; so the next 
result contains Theorem ll.il 

Theorem 3.1. With the preceding modified definitions, Theorem \1.1\ holds 
for directed percolation. 

Proof. The proof is essentially the same as that of Theorem 1.1, the only 
difference being that we should now take iV to be the set of those i G" Z for 
which there is at least one edge (i,j) or {i, j} with j G Z, and modify the 
definition of 5* similarly. □ 

Our first extension of Theorem II. 31 to directed percolation is 

Theorem 3.2. Let s G V , X C V \ {s} , and f,g increasing functions of 
C s . Then on {s -f* X}, 

E/0 > E/E0. 

This can be derived from Theorem 13.11 in the same way as Theorem 11.31 was 
derived from Theorem 11.11 It can also be proved using Markov chains (fol- 
lowing either the proof of Theorem 12 .11 or the alternate sketched afterwards), 
where we should now replace Ct (in its various incarnations) by the set of 
edges in paths ending in t. 

Remarks, and consequences for Contact Processes 

(i) Analogously to what we said in Section 1, Theorem 13.21 can be stated 
in terms of (conditional) positive association; namely for any X C V, the 
random variables rj(y) := l{ s ^ y y, conditioned on the event {r/ = on X}, 
are postively associated. 

(ii) Taking A = B = Q in Theorem 13.11 gives 

Pt(R x ) Pt(R y ) < Pt(Rxuy) Pr(iW). (17) 

(iii) Belitsky, Ferrari, Konno and Liggett ([2], Theorem 1.5) proved a special 
case of (J17)) involving a particular graph on the vertex set Z, 2 . Their argument 
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actually applies whenever V admits a partition (Vo = {s}) U V\ U • • • such 
that each edge is directed from V^_i to for some i and X U Y is contained 
in some V^, but does seem to depend essentially on these properties. 

(iv) Much of [2] deals with the contact process on a countable set S. See 
[TUj and |TT] for background on this model; very briefly: Each site (individ- 
ual) in S can be in either of the states 1 (ill and contagious) or (healthy, 
noncontagious). Time is continuous, with i] t (x) denoting the state of site x 
at time t. An infected site x becomes healthy at rate 8 X , and a healthy site 
becomes ill at rate ^ X(x, y)r)(y). Here 5 x ,x G S and X(x, y), x,y G S, are 
the parameters of the model. They are assumed to be non-negative and, if S 
is infinite, to satisfy the following conditions (see [2]): s\ip xeS 8(x) < oo and 

sup xe5 J2 yeS [ X ( x , y) + Kv, x )\ < oo. 

A nice aspect of the model is that it can be viewed in terms of perco- 
lation, via a graphical representation (see e.g. [TT], pages 32-34): being ill 
at some given time corresponds to the existence of an appropriate path in 
space-time. In fact, as is well-known, the process can, by time-discretization, 
be approximated by a directed percolation model. (See the subsection on 
correlation inequalities, in particular page 11, of JT] for the general idea of 
how correlation inequalities for collections of independent Bernoulli random 
variables can be extended to continuous-time interacting particle systems, 
and page 65 of [TT] for a concrete example for the contact process). Com- 
bining this with the present results, one obtains, in a straightforward way, 
contact process analogues of the conditional association property stated in 
(ii) above. In particular this gives the following theorem. 

Theorem 3.3. Suppose (r] t : t > 0) is a contact process as above, with 
deterministic initial configuration t]q. Then for each W C S and t > 0, 
the collection (f] t (x) : x G S \ W) is conditionally positively associated given 
{r) t = on W}. 

(An example of Liggett shows that if we instead condition on {rj t = 1 
on W}, the above positive association need not hold.) 

Suppose now that at time each site is ill. Let v t be the law of r) t 
(= (i]t(x) : x G S)). It is well-known (and follows easily from standard 
monotonicity arguments) that as t — > oo, v t tends to a limit, called the 
upper invariant measure of the process and denoted v. Clearly the preceding 
conditional association property for finite times extends to v. So, if W C S, 
and A and B are events that are determined by, and that are both increasing 
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(or both decreasing) in the rj(x), x E S \ W, then 

u(AB\r] = 0onW) > v(A \ r] = OonW) u(B 1 77 = on W). (18) 

This is a considerable strengthening of a conjecture of Konno ( 9j, Conjecture 
3.4.13), which was proved in — and seems to have been the main motivation 
for — [2] (see inequality (1.3) in [2j), namely: for any K,L C S, 

u(KHL) u(KuL) > u(K)u(L), (19) 

where, for M C S, u(M) := v{ri : r] = OonM}. (Of course (fT9"j) is the 
special case of (fT8|) in which W = K n L, A = {r/ = Ooni^\L} and 
B = {rj = OonL\K}.) 

A directed version of Theorem 11.51 

For a sensible generalization of Theorems ll.4l and ll.5l to the present setting 
we need a different substitute for {s <y4 t}. It is easy to see that neither {s -f* 
t) nor {s y4 t y4 s} will do here (e.g. consider the graph on {s,t,v,a} with 
(oriented) edges (s,v), (t,v), (v,a), and events A = {s — ► a}, B = {t — > a}); 
but there is another natural choice which does work, at least when we assume 
there are no undirected edges. Recall V(F) is the set of vertices incident with 
edges of F. 

Theorem 3.4. Assume G is a digraph in the usual sense (that is, all its 
edges are directed). Let s and t be (distinct) vertices, and f and g bounded, 
measurable functions of (C s , C t ), each increasing in C s and decreasing in C t . 
Then on Q := {V(C S ) n v\c t ) = 0}, 

E/0 > E/E^. (20) 

As mentioned in Section ^ this can be proved along the lines of either 
Theorem II .51 or Theorem 12. 11 with Theorem 13. 21 a crucial ingredient in either 
case. Here we only give (sketchily) the second argument, leaving the reader 
to fill in the first (which, like the second, depends on Observation 13. 5l below). 

It's a little strange that we can so far prove Theorem 13.41 only in the 
absence of undirected edges, and we conjecture that it remains true without 
this restriction. (The difficulties in extending the proof below — those for the 
other version are essentially the same — are the (related) failures of Observa- 
tion 13.51 and of the validity of the hypothesis (iii) when we come to apply 
Lemma 12.61 ) 
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Proof. We will not repeat the proof of Theorem 12 .H but just indicate what 
changes are needed in the present situation. 

The state space Cl and transitions for our Markov chain are essentially as 
before. (Here we have chosen to say C s , C t rather than Cs, Ct, but as noted 
earlier (following Theorem II. 2j) this really makes no difference.) Of course 
{V(C S ) fl V(C t ) = 0} now replaces {S */> T} as the conditioning event Q. 

Let us write ip for our (unconditioned) percolation measure, <p for our 
ip conditioned on Q and (p for the measure which ip induces on Cl. The 
argument here then follows that for Theorem 12.11 verbatim until, in proving 
positive association of the collection (JEJ), we come to establishing conditions 
(ii) and (iii) of Lemma ESI For these we need the easily verified (but crucial) 

Observation 3.5. For U C V the distribution of C s is the same under 
ip conditioned on {V(C t ) = U} as under ijj conditioned on {s -f* U} (and 
similarly with the roles of s and t reversed). 

In view of this, (ii) is an immediate consequence of Theorem 13.21 (the 
relevant information from conditioning on W being just the resulting value 

ofcr 1 ). 

For (iii) we first observe that if W, W are possible values of (W%, . . . , W a ) 
with W < W, and U,U' are the corresponding values of ^(C™ -1 ), then 
according to Remark 12.81 we have U C U' . Moreover, by Observation 13.51 
the distribution of (Z 1; . . . , Z b ) (= (X™ : e G E)) given W is simply the 
distribution of (the indicator of) E\C t under ip conditioned on {t -/-> U}. 
So, writing U for this distribution on (X™ : e G E), we need to show that 
U' ~D U implies (pu 1 >~ <Pu (note that increasing C™ corresponds to decreasing 
the X"'s). This follows from Theorem 13 .21 Notice that tpw is the same as (pu 
conditioned on B := {t y4 JJ' \ U}. (More accurately, <pu> is the distribution 
induced on the indicator of E \ C t by ip(-\t -/^ U) conditioned on B.) But 
then, since B is a decreasing event determined by C t , Theorem 13 . 21 savs that 
under tpu, B is negatively correlated with any increasing event determined 
by C t \ that is, <fu' >- <Pu- D 

Remark. The choice of Cl is a key to the preceding argument. For instance, 
taking the state space to be the analogue of that in the alternative proof 
of Theorem 12.11 sketched at the end of Section |2~T1 — namely {uj G {0, 1} E : 
Q holds for u} — gets in trouble because we lose some positive correlations, 
e.g. of events {u e = 1}. 
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